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Abstract. The evolution of the largest component has been studied intensely in 
a variety of random graph processes, starting in 1960 with the Erdos-Renyi process. 
It is well known that this process undergoes a phase transition at n/2 edges when, 
asymptotically almost surely, a linear-sized component appears. Moreover, this phase 
^"n transition is continuous, i.e., in the limit the function f(c) denoting the fraction of 

vertices in the largest component in the process after cn edge insertions is continuous. 
A variation of the Erdos-Renyi process are the so-called Achlioptas processes in which 
i— i in every step a random pair of edges is drawn, and a fixed edge-selection rule selects one 

of them to be included in the graph while the other is put back. Recently, Achlioptas, 
D'Souza and Spencer [T] gave strong numerical evidence that a variety of edge-selection 
rules exhibit a discontinuous phase transition. However, Riordan and Warnke very 
recently showed that all Achlioptas processes have a continuous phase transition. In this 
work we prove discontinuous phase transitions for a class of Erdos-Renyi-like processes 
in which in every step we connect two vertices, one chosen randomly from all vertices, 
and one chosen randomly from a restricted set of vertices. 

o 

1. Introduction 

In their seminal paper [8] from 1960 Erdos and Renyi analyze the size of the largest component in 
the prominent random graph model G n ^ m , a graph drawn uniformly at random from all graphs on 
n vertices with m edges. For any graph G let L\{G) denote the size of its largest component. We 
say that an event occurs asymptotically almost surely (a.a.s.) if it occurs with probability 1 — o(l) 
as n tends to infinity. 

Theorem 1 (|8j). For any constant c > the following holds. 

• If c < 0.5, then a.a.s. Li(G n ^ cn -\ ) = O(logn). 

• If c > 0.5, then a.a.s. Li(G n ^ cn ^ ) = £l(n), and all other components have O(logn) vertices. 

This result can be seen from a random graph process perspective. Starting with the empty graph 
on the vertex set [n] := {1, 2, . . . , n}, we add a single edge chosen uniformly at random from all 
non-edges in every step. It is not hard to see that the graph after inserting m edges is distributed as 
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Figure 1. Evolution of the largest component over the first n edges of Erdos- 
Renyi process, min-product and min-sum rule and the half-restricted process with 
parameters 0.25, 0.5 and 0.9 on 20 million vertices 



G n ^m- In this context Theorem [T] states that, asymptotically, a linear-sized component (a so called 
'giant') appears around the time when we have inserted n/2 edges. We say that the Erdos-Renyi 
process has a phase transition at n/2. 

This phase transition has been studied in great detail (for a survey see Chapter 3 in |9j). It is known 
that Li(G 

n,cn) a.a.s. satisfies Li(G niCn ) — (1 + o(l))/(c)n for some continuous function /(c) with 
/(c) = for every c < 0.5 and hm £ ,_ > o.5+ /(c) = 0. Thus, the phase transition in the Erdos-Renyi 
process is continuous^ (see Figure [l]). 

A variant of the Erdos-Renyi process which gained much attention over the last decade, mostly 
concerning the question whether one can delay or accelerate the appearance of the giant component 
[21 [12], is given by the so-called Achlioptas processes. In this variant one starts with the empty graph 
on the vertex set [n], and in every step gets presented a pair of edges chosen uniformly at random 
from all pairs of non-edges in the current graph. A fixed edge-selection rule then selects exactly one 
of them to be inserted into the graph while the other is put back into the pool of non-edges. 

Recently, Achlioptas, D'Souza and Spencer pQ provided strong numerical evidence that the min- 
product (select the edge that minimizes the product of the component sizes of the endpoints) and 
min-sum rule (select the edge that minimizes the respective sum) exhibit discontinuous transitions, 
in contrast to a variety of closely related edge-selection rules, in particular the ones analyzed in 
|12j . The authors conjectured that a.a.s. the number of edge insertions between the appearance 
of a component of size y/n and one of size n/2 is at most 2n 2 / 3 = o(n), that is, at the phase 
transition a constant fraction of the vertices is accumulated into a single giant component within a 
sublinear number of steps (see Figure [T]). This phenomenon is also called explosive percolation and 
of great interest for many physicists. Thus, a series of papers has been devoted to understanding 
this phase transition (see e.g. [21 IH OH] ) > most of the arguments not being rigorous but supported 
by computer simulations. 

Countering the numerical evidence it was claimed in [5] that the transition is actually continuous. 
Recently, Riordan and Warnke in [IT] indeed confirmed this claim with a rigorous proof. In fact, 
their argument shows continuous phase transitions for an even larger class of processes. A random 
graph process (G(T))t>q is called an ^-vertex rule if G(0) is empty, and G(T) is obtained from 
G(T — 1) by drawing a set Vr of £ vertices uniformly at random and adding only edges within Vr, 
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where at least one edge has to be added if all vertices in Vr are in different components. Observe 
that every Achlioptas process is a 4-vertex rule. 

2. Our Results 

In this work we prove that a variant of the Erdos-Renyi process which we call half-restricted process 
exhibits a discontinuous phase transition. Intuitively speaking, a discontinuous transition can only 
occur if one avoids connecting two components that are already large. The idea to avoid this in an 
Achlioptas process is to present two edges and choose the one that connects smaller components, 
which essentially is what the min-product and min-sum rule do. However, we pursue a different 
approach here. We connect two vertices in every step, but we restrict one of them to be within the 
smaller components. 

For every < /3 < 1 and every labeled graph G with n := |V(G)| we define the restricted vertex set 
Rp{G) to be the [(3n\ vertices in smallest components. Precisely, let v\, V2, ■ ■ ■ ,v n be the vertices 
of G sorted ascending in the sizes of the components they are contained in, where vertices with the 
same component size are sorted lexicographically. Then Rp{G) := {i>i,i?2, • • • ^[f3 n \}- 

The half- restricted process has a parameter < (3 < 1 and starts with the empty graph Hp(0) = 
lZ nt p(Q) on the vertex set [n]. In every step T > 1 we draw one unrestricted vertex v% 6 [n] uniformly 
at random and, independently, one restricted vertex V2 6 Rp(TZp(T — 1)) uniformly at random. We 
obtain TZp(T) by inserting an edge between v% and V2 if the edge is not already present (in which 
case we do nothing). Note that for (3 < 1 the half- restricted process is not an £- vertex rule. 

For a half-restricted process let ay be the random variable that denotes the maximum size of all 
components that the restricted vertex can be drawn from in step T. Clearly, ay is increasing in T. 
For every positive integer k we denote the random variable for the last step when ax is still below 
k by 

Tfe := max{T : ay < k} . 

Our main result is that for any parameter d < 1 the half-restricted process exhibits explosive 
percolation. Thus, even though Figure [T] suggests that the phase transitions of the min-product 
(or min-sum) rule and the half-restricted process behave similarly, their mathematical structure is 
fundamentally different. 

Theorem 2 (Main Result). Let < /3 < 1. For every K = K(n) with ln(n) 1 ' 02 < K < n, every 
C = C(n) with 1 <C C <C ln(AT), and every e > we have a.a.s. that 

(i) Lx(Kp{T c )) < K, and 

(ii) LxiTZ^Tc + n/HC))) > (1 - e)(l - /3)n. 

Note that for K = ln(n) 2 and C = lnlnlnn the theorem shows that, a.a.s., the number of steps 
from the first appearance of a component of size ln(n) 2 to the appearance of a component of size 
((1 - /3)/2) • n is <D(n/ In In In Inn) = o(n). 

3. Proof of the Main Result 
In our proof we need a rather technical lemma introduced in the following. 

Let iV be a positive integer, and for every < i < N let X{ ~ Qeom{^ 1 ) be a geometrically 

distributed random variable and set X(a,b) := X^=a^* f° r ever y < a < 6 < A^. All subsequent 
statements and arguments are about sums of geometrically distributed random variables, but we 
will use that they have the following combinatorial interpretation to a coupon collector scenario with 
A^ coupons. (In a coupon collector scenario, we have a number of different coupons and repeatedly 
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draw one uniformly at random with replacement. We are interested in how often we have to draw 
until we have seen every coupon.) Observe that X{ is distributed like the number of coupons we 
need to draw while holding exactly i different coupons (waiting for the (i + l)st), and thus X(a, b) 
can be seen as the number of coupons we draw while holding between a and b different coupons or, 
equivalently, to obtain b — a + 1 coupons of a fixed subset of N — a coupons. 

Note that 

6 N-a 

E[X(a, b)} = V = N V ~ = N(H N - a -H N - b - 1 ) , (1) 

L — ' TV — i ^-^ i 

i=a i=N—b 

where H n = Ya=1 V* denotes the n-th harmonic number for every n > 1. The following lemma 
concerns the situation when a = N — w(l) and b = N — 2. In terms of coupon collector, we want to 
collect all but 1 coupon of a fixed subset of to (1) coupons. We show an exponential bound on the 
probability that the number of coupons we need to draw for this is much lower than its expectation. 

Lemma 3. Let k = k(N) = w(l) and s = s(N) <C iVln(fc). Then for N large enough 

Pr[X(N - k, N - 2) < s] < e~ k °" . 

Proof. First note that by we have 

E[X(N-k,N-2)] = N(H k -H 1 ) k= = {1) (1 + o(l))Nln(k) . 

Thus, s K[X(N— k, N—2)]. Consider N different coupons and let {ci, C2, • • . , c^} be a fixed subset 
of them. We now draw s coupons with replacement and call the resulting set F. For every 1 < j < k 
let Yj be an indicator variable for the event that coupon Cj is not drawn within these s trials, i.e., 
Cj ^ F. By the comments preceding this lemma, the probability of the event X{N — k,N — 2) < s 
equals the probability that at most 1 of these coupons is not drawn within the s trials. Hence, for 
Y := Ylj=i Yj we observe that 

Pr[X(N - k, N - 2) < s] = Pr[y < 1] . (2) 
Using the identity 1 — x > e~ 2x which holds for all < x < 1/2 we have for every 1 < j < k that 



E[V y ] = Pr[l) ■ J.]- (l-l) S >e- 



and thus, for N large enough, 

2s 1 e s<iVln(fc) 

E[Y] >ke~N= k Nin(k) > 8^0.99 _ 

One can check that the random variables Yi, Y2, ■ ■ ■ , Yf. are negatively associated (see e.g. Chapter 
3 in [7]). Hence, we can apply Chernoff bounds to Y and obtain for N large enough that 



Pr[y < 1] f Pr \y < (l - -)e[Y]] < e- E I y ]/ 8 f 



1.0.99 

2/— 

which together with ([2]) finishes the proof. □ 
We now turn to the proof of Theorem [2] 

Proof (of Theorem^. We fix < 1, K = K[n) with ln(n) 1 - 02 < K < n, C = C{n) with 1<C« 
\n(K), and e > 0. To simplify notation we write TZ(T) instead of 1Zp(T). 

We first address (i). We need to show that at the step when the restricted vertex can for the first 
time be in a component of size C, there is a.a.s. no component of size larger than K. The main 
idea is that a large component needs to be drawn by the unrestricted vertex so often that this is 
unlikely to happen within Tq steps. 
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Note that up to step Tc two components of size at least C can never be merged by an edge since 
the restricted vertex in every step is drawn from vertices in components of size less than C. Hence, 
we can easily keep track of these components. We call a component A in 1Z{T) a chunk if it has 
size at least C. Let A\,A2,... denote all chunks in order of appearance during the process. By the 
pigeonhole principle, there can be at most n/C chunks. For every 1 < i < n/C we denote by Si the 
event that chunk Ai has size larger than K in lZ{Tc). We will show that Pr[£j] < 1/n for every 
1 < i < n. By applying the union bound this implies 

'n/C 

Pr[1Z(Tc) contains a comp. of size > K] = Pr 



It remains to bound Pr[£j] for every 1 < i < n/C. Let 1 < i < n/C be fixed for the remainder of 
the proof. Since for every T < Tc the restricted vertex is drawn from vertices in components of 
size smaller than C, the chunk Ai can grow by at most C in every step. Hence, the chunk has size 
at most (J + 1) • C before a vertex from the chunk is drawn for the jth time. Since moreover, a 
chunk has size at most 2C when it appears, a vertex from the chunk needs to be drawn in at least 
K/C — 1 steps after its appearance for Si to happen. Let X2, X4, . . . , Xk/c denote the number 
of steps between steps in which we draw a vertex from Ai. That is, X2 is the number of steps 
from the appearance of the chunk until a vertex from Ai is drawn for the first time, X3 is the time 
from that step until a vertex from Ai is drawn for the second time, and so on. Furthermore, let 

X:= Zf£Xj- Then, 

Pr[Si] < PrLY < T c ] , (5) 

and it thus suffices to show that Pi[X < Tc] < 1/n. Clearly, only the unrestricted vertex in every 
step can be in Ai, and thus, Xj is stochastically dominated by a geometrically distributed random 
variable with parameter jC/n. Hence, let Yj be a random variable with Yj ~ Qeom(jC/n) and set 

Y-=Ef£Yr Then 

Pr[X < T c ] < Pr[F < T C ] ■ (6) 

This is exactly the setup of Lemma [3] with N = n/C, k = K/C and s = Tc- Concerning the 
prerequisites of the lemma, we have k = w(l) since C <C \n(K) K. Furthermore, it is not hard 
to show that Tc < 4ra (see e.g. Lemma 3 and the remark following the proof in [11]) and thus 

»r, ,,s n , f K \ C<\n(K) , , ..nhi(K) C<^\n(K) 

iVln(fc) = --ln(-J = l j (l-o(l))— ±-L > An>T c = s. 

Hence, we can apply Lemma [3j which gives us for n large enough that 

Pr[Y < T C ] < e-W c )°-" . 
Since K > ln(n) 102 and C <C ln(K) we have 

Pr[y < T c ] < e" ln(n) = - . 

n 

Using Q, ([5]) and Q this settles the proof of (i), and it remains to prove (n), i.e., we have to show 
that TZ(Tc + n/ln(C)) contains a component of size (1 — e)(l — /3)n with high probability. 

We set a := n/(21n(C)) and split the proof into two parts. In the first part (the first a additional 
steps after Tc) we collect a suitable amount of vertices in components of size at least C, and in the 
second part (the remaining a steps) we actually build a giant component on these vertices. 

Consider the steps Tc + 1 to Tc + a in the graph process. Let U(T) denote set of vertices in 
components of size at least C in 1Z(T). We show a lower bound on the size of U(Tc + a) that 
holds with high probability. Note that by definition of Tc we have \U(Tc + 1)| > (1 — /3)n. If 
\U(Tc + a)| > (1 — (3/2)n, we have a bound that is by far good enough for our purposes, hence we 
restrict ourselves to the case \U(Tc + i)\ < (1 — (3/2)n for every 1 < i < a. 
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For every 1 < i < a let Xj := \U(Tc + i)\ — \U(Tq + i — 1)| denote the number of vertices added 
to the components of size at least C in the ith additional step. Furthermore, let X := Y2i=i-^-i- 
We lower bound the probability that Xi contributes at least 1 vertex. Clearly, this happens if the 
unrestricted vertex is drawn from components of size at least C, which by definition of Tq happens 
with probability at least 1 — 0, and if the restricted vertex is drawn from components of size smaller 
than C, which happens with probability at least 1/2 since we assume \U{Tc + i)\ < (1 — 0/2)n. 
Hence, for every 1 < i < a we have 

1-0 

Pr[X< > 1] > . 

Hence, K[X] > a(l — 0)/2 = (1 — /3)n/(41n(C)). In particular, X stochastically dominates a sum of 
a independent Bernoulli random variables with parameter (1 — 0)/2, such that we can use Chernoff 
bounds to obtain 

1 (l-p)n' 



Pr 



X < 



< e e( Mc)) = (i) 



2 41n(C) 

Hence, we can in the following condition on the event that 

\U(T c + a)\>(l-B)n+±=^n . (7) 

We now look at the second half of additional steps, i.e., steps Tq + a + 1 to Tq + 2a, and show that 
a.a.s. in these steps a sufficiently large component is created within U := U(Tc + a). Note that U 
is a fixed set of vertices which does not change from step to step. 

Assume that TZ(Tq + 2a) has no component of size (1 — e)(l — (3)n. We now show that a.a.s. this 
assumption leads to a contradiction. 

We call a step successful if it connects two components in U. Since every component in U has size 
at least C we have that 

(l-0)n 



C 



(8) 



successful steps will connect all components in U such that U forms one giant component of size 
at least (1 — 3)n. We now compute the probability to have a successful step if we do not have a 
component of size (1 — e)(l — /3)n. For a successful step, the restricted vertex needs to be in U 
which happens with probability at least 

\U\ - (l-0)n ^ 1-0 

0n - 80HC) ' 

and the unrestricted vertex needs to be drawn from a different component in U, which happens 
with probability at least 

n 

Hence, the probability to have a successful step is at least 

e{l-0? 



p :-- 



lln(C) 



Thus, the number T of successful steps stochastically dominates a sum of a independent Bernoulli 
distributed random variables with parameter p, and we obtain that 

^ rml n e(l-0) 2 e(l-0) 2 n 
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and by Chernoff bounds 



Pr 



T < 



C 



Pr 



Pr 



T < 1 



16/31n(C) 2 \\ £(1-/3) 



ln(C) 2 /C=o(l) 
< 

< e -E[T]/8 
j9j -fi( 



ln(C) : 



Pr 



r) 



r < -e[t] 



o(l) 



n 



(l-p)eCJ 



16/3 ln(C) s 



Hence, we a.a.s. have at least (1 — f3)n/C successful steps, which together with the considerations 
around (pi) contradicts the assumption to have no component of size (1 — e)(l — j3)n. □ 
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